Probabilistic Latent Tensor Factorization for 3-way Microarray Data Analysis with Missing Values

نویسندگان

  • Umut Şimşekli
  • Yunus Emre Kara
  • Arzucan Özgür
  • Ali Taylan Cemgil
چکیده

The recent advances in microarray technology enabled the measurement of gene expression levels of samples over a series of time points. Unlike the traditional 2D microarray data, such experiments generate 3D (gene-sample-time) microarray data, which require specialized methods for analysis. In this study, we propose a novel tensor factorization model for modeling 3D microarray data. The model assumes the existence of certain temporal patterns that are repeated over time. One main advantage of the model is that it handles the missing data implicitly, so that the estimation process is not effected by the existence of missing values, which commonly occur in microarray data. We evaluate our model on classification of the good or bad responders to Interferon beta (INFβ) treatments by using a real gene-sample-time microarray data set and achieve a promising prediction performance.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Variational Inference For Probabilistic Latent Tensor Factorization with KL Divergence

Probabilistic Latent Tensor Factorization (PLTF) is a recently proposed probabilistic framework for modelling multi-way data. Not only the common tensor factorization models but also any arbitrary tensor factorization structure can be realized by the PLTF framework. This paper presents full Bayesian inference via variational Bayes that facilitates more powerful modelling and allows more sophist...

متن کامل

Probabilistic Tensor Factorization for Tensor Completion

Multi-way tensor datasets emerge naturally in a variety of domains, such as recommendation systems, bioinformatics, and retail data analysis. The data in these domains usually contains a large number of missing entries. Therefore, many applications in those domains aim at missing value prediction, which boils down to a tensor completion problem. While tensor factorization algorithms can be a po...

متن کامل

Scalable Probabilistic Tensor Factorization for Binary and Count Data

Tensor factorization methods provide a useful way to extract latent factors from complex multirelational data, and also for predicting missing data. Developing tensor factorization methods for massive tensors, especially when the data are binaryor count-valued (which is true of most real-world tensors), however, remains a challenge. We develop a scalable probabilistic tensor factorization frame...

متن کامل

Liver CT Annotation via Generalized Coupled Tensor Factorization

This study deals with the missing answers prediction problem. We address this problem using coupled analysis of ImageCLEF2014 dataset by representing it as a heterogeneous data, i.e., dataset in the form of matrices. We propose to use an approach based on probabilistic interpretation of tensor factorization models, i.e., Generalized Coupled Tensor Factorization, which can simultaneously fit a l...

متن کامل

Multi-HDP: A Non Parametric Bayesian Model for Tensor Factorization

Matrix factorization algorithms are frequently used in the machine learning community to find low dimensional representations of data. We introduce a novel generative Bayesian probabilistic model for unsupervised matrix and tensor factorization. The model consists of several interacting LDA models, one for each modality. We describe an efficient collapsed Gibbs sampler for inference. We also de...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012